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Abstract 

Through a Metropolis-like algorithm with single step computational 
cost of order one, we build a Markov chain that relaxes to the canoni- 
cal Fermi statistics for k non-interacting particles among m energy levels. 
Uniformly over the temperature as well as the energy values and degen- 
eracies of the energy levels we give an explicit upper bound with leading 
term km In k for the mixing time of the dynamics. We obtain such con- 
struction and upper bound as a special case of a general result on (non- 
homogeneous) products of ultra log-concave measures (like binomial or 
Poisson laws) with a global constraint. As a consequence of this general 
result we also obtain a disorder-independent upper bound on the mixing 
time of a simple exclusion process on the complete graph with site dis- 
order. This general result is based on an elementary coupling argument, 
illustrated in a simulation appendix and extended to (non-homogeneous) 
products of log-concave measures. 
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1 From the Fermi statistics to general condi- 
tional products of log-concave measures 

1.1 Sampling the Fermi statistics 

Given two positive integers k and to, given a non-negative real number /?, given 
to real numbers v\, . . . , v m and given to integers m, . . . , n m such that 

m 

n := ^ rij > k, (1.1) 

i=i 

the canonical Fermi statistics at inverse temperature (3 for A: non-interacting 
particles among the to energy levels 1, . . . , to, with energy values V\, . . . , w m 
and degeneracies n\, . . . , n m is the conditional probability measure on 

X k , m := {(fei, . . . , fc m ) € N m : fci + • • • + k m = k} (1.2) 

given by 

v := ii{-\X ktm ) (1.3) 
with \i the product measure on N m such that 



1 m /nA 

//(fci, . . . , k m ) := - Y[ y k 3 J exp{-(3kjVj}, 



(1.4) 



Z: = E II W exp{-/3^}, (1.5) 
fci,...,fc m j=i V j/ 

with (™ j ) =0 whenever ftj > n^. In other words, ^ is a (non- homogeneous) 
product of binomial laws in k\ , . . . , k m with the global constraint 

h + --- + k m = k (1.6) 

and we can write 

v(k u ...,k m ) = -l[e^^\ (k 1 ,...,k m )eX k>m (1.7) 

where the 0j are defined by 

: lj £ N 4 -/SfcjUj + In {^j elU {-oo} (1.8) 

and Q is such that v is a probability measure. 

The first aim of this paper is to describe an algorithm that simulates a 
sampling according to v in a time that can be bounded from above by an 
explicit polynomial in k and to, uniformly over (3, {vj)i<j< m and («j)i<j<m- 
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The reason why we prefer a bound in k and m rather than in the 'volume' of 
the system n = J2j n ji wm De clarified later. 

A first naive (and wrong) idea to do so consists in choosing the position (the 
energy level) of a first, second, . . . and eventually k th particle in the following 
way. First choose randomly the position of the first particle according to the 
exponential weights associated with the 'free entropies' of the empty sites, that 
is choose level j with a probability proportional to exp{— fivj + In rij}. Then 
decrease by 1 the degeneracy of the chosen energy level and repeat the procedure 
to choose the position of the second, third, . . . and eventually fc th particle. It 
is easy to check that, doing so, the final distribution of the occupation numbers 
fci, . . . , k m associated with the different energy levels, that is of the numbers 
of particles placed in each level, is in general not given by v as soon as k 
is larger than one. But it turns out that this naive idea can be adapted to 
build an efficient algorithm to perform approximate samplings under the Fermi 
statistics. 

Very classically, the fast sampling performed by the algorithm we will build 
will be obtained by running a Markov chain X with transition matrix p on 
Xk,m and with equilibrium measure v. The efficiency of the algorithm will be 
measured through the bounds that we will be able to give on the mixing time t e , 
defined for any positive e < 1 by 

t e := inf{i > : d(t) < e} (1.9) 
d(t) := max ||p*(»?, ■) - i/||tv (1-10) 

where || • ||tv stands for the total variation distance defined for any probability 
measures v\ and v 2 on Xk,m by 

||^i - z^||tv := max \vi(A) - vi{A)\ = \ V | z^i (r?) - iy 2 (v)\- (1-U) 

As a consequence, estimating mixing times is not the only one issue of this 
paper, building a 'good' Markov chain is part of the problem. 

As far as that part of the problem is concerned, we propose to build a 
Metropolis-like algorithm that uses the 'free energies' of the naive approach to 
define a conservative dynamics. Assuming that at time i £ N the system is 
in some configuration X t = r/ in Xk, m with 1/(77) > 0j and defining for any 
r/ = (fci, . . . , k m ) and any distinct i and j in {1; • • • ; m} 

{k s for se{l;-;m} \ {i; j} 

h-1 ifa = i 
kj + 1 if s = j 

(1.12) 

the configuration at time t+1 will be decided as follows: 

• choose a particle with uniform probability (it will stand in a given level i 
with probability fcj/fc), 
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• choose an energy level with uniform probability (a given level j will be 
chosen with probability 1/m), 

• with i the level where stood the chosen particle and j the chosen energy 
level, extract a uniform variable U on [0; 1) and set X t +\ = rfi if % ^ j 
and 

U < exp{-/3«j + \n(nj - kj) + fiv t - ln(nj - {h ~ 1))}, (1.13) 
Xt+i = r\ if not. 

In other words, denoting by [a] + = (a + |a|)/2 the positive part of any real 
number a and with 

ipj : kj e {Q; • • • ; rij} t-> —/3vj + ln(nj — fcj) 6lU {-oo}, j e {1; • • • ; m}, 

(1.14) 

for any distinct i and j in {1; • • • ; m} 

P{X t+ i=rfi\X t = r,)=p(ri,ri^) = |i exp{-fo&i(fci - 1) - Vi(^)] + }, (1.15) 
and 

PpQ +1 = = 77) = pfa, 77) = 1 - fa. (1-16) 

Remark: In order to avoid any ambiguity in (|1.15p in the case k{ — 0, we set 
1) = +00 (even though the algorithm we described does not require any 
convention for rpi(— 1)). 

This Markov chain is certainly irreducible and aperiodic. To prove that it 
relaxes to v we will check the reversibility of the process with respect to v. Then 
we will have to estimate the mixing time of the process. We will carry out both 
the tasks in a more general setup. 

1.2 A general result 

For any function / : N — > K we dchnc 

V+/ : x e N ^ V+/ := f(x + 1) - f(x) (1.17) 

V-/ : x e N \ {0} ^ V-/ := f(x - 1) - /(a) (1.18) 
A/:x£N\{0}4 A,/ := V+/ + V~/ = -V^(V+/) (1.19) 
and we say that a measure 7 on the integers 

7:i6Nh>e* ) gl + , (1.20) 

with <j) : N — > K U {—00}, is log-concave if N \ 7 _1 ({0}) is an interval of the 
integers and 

l{xf > 7(1 -1)7(1 + 1), ieN\{0}, (1.21) 
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i.e., if V + </> is non-increasing, or, equivalently, —A<fi is non-negative (with the 
obvious extension of the previous definitions to such a possibly non- finite </>). 
The measure \i defined in (jl.4[) is a product of log-concave measures and the 
canonical Fermi statistics is such a product measure normalized over the condi- 
tion pTB|) . 

N.B. From now on, and except for explicit mentioning of additional hypotheses, 
we will only assume that the probability measure v we want to sample is a product 
of log- concave measures normalized over the global constraint \l.b\) . i.e., that v 
is a probability on Xk,m that can be written in the form Jj.Tp with non-increasing 
V + 4> 3 'a. 

In this more general setup we will often refer to the indices j in {1; • • • ; m} as 
sites rather than energy levels of the system. 

Actually the e^ j 's of the Fermi statistics are much more than log-concave 
measures. They are ultra log-concave measures according to the following defi- 
nition by Pemantle [2] and Liggett [3]. 

Definition 1.2.1 A measure 7 : N — > R+ is ultra log-concave if x\ r y{x) is 
log-concave. 

In other words e^ j is ultra log-concave if and only if 

:= V+<fc + ln(l + •) (1.22) 

is non-increasing (for the Fermi statistics observe that so are the <pj 's and that 
(fL~22j) is consistent with ([L~Ti]> ), 

For birth and death processes that are reversible with respect to ultra log- 
concave measures, Caputo, Dai Pra and Posta [10] proved modified log-Sobolev 
inequalities and stronger convex entropy decays, both giving good upper bounds 
on the mixing time of the processes. Johnson [13] proved also easier Poincare 
inequalities that give weaker bounds on the mixing times. We refer to [3], 
0, [S] for an introduction to this classical functional inequality approach to 
convergence to equilibrium and we note that for such birth and death processes 
the ultra-log concavity hypotheses allowed for a Bakry-Fmery like approach 
(see pQ) to derive (modified) log-Sobolev inequalities. Actually, [10] was an 
attempt to extend this celebrated analysis to Markov processes with jumps (see 
also |15) for a more geometric perspective). But it turns out, as we will soon 
review, that beyond the case of birth and death processes the role of ultra log- 
concavity to follow the Bakry-Emery line in the discrete setup is still unclear. 

In this paper we will not follow the functional analysis approach to control 
mixing times. To bound from above the mixing time of a Markov chain that is 
reversible with respect to a conditional product of ultra log-concave measures, 
we will follow (and recall in Section [2]) the non less classical, and, in this case, 
elementary, probabilistic approach via coalescent coupling. We will prove: 
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Theorem 1 Iff derives from a product of ultra log-concave measures, then the 
Markov chain with transition matrix p defined by 

p(v,v ij ) = ^-eM-mh - i) - 

rC 777 

r) = (ki,..., k m ) e Xk,m \ ^ _1 {o}, 
Mie{i;-;m}. t 1 - 23 ) 

pfor?) = l-^pfa,^') (1.24) 

is reversible with respect to v and, for any positive e < 1, its mixing time t e 
satisfies 

t e < km\n(k/e). (1.25) 

Proof: sec Section [2l 

The most relevant point of Theorem Q] with respect to the previous results 
we know stands in the uniformity of the upper bound above the disorder of 
the system (except for the ultra log-concavity hypothesis on in each j). In 
particular and as far as the Fermi statistics is concerned, our estimate does 
not depend on the temperature, and, more generally it is independent from the 
energy values as well as the level degeneracies. 

To illustrate this fact let us start with the case rij = 1 for all j. In this case 
our dynamics is a simple exclusion process with site disorder. Caputo ([6], [12]) 
proved Poincare inequalities for such processes, in their continuous time version, 
assuming a uniform lower (and upper) bound on general transition rates, while 
Caputo, Dai Pra and Posta [10], looking at particular rates for the process and 
still assuming moderate disorder - that is, uniform lower and upper bounds 
on these rates - proved a modified log-Sobolev inequality. For the particular 
choice of rates they made, the upper bound on the mixing time implied by [10] 
could not hold in a strong disorder context (for example with k = 1, m = 3, 
v\ = v% = 0, t>3 > and fj 3> 1). Our uniformity over the disorder of the system 
depends then strongly on our particular choice for the transition probabilities. 
As it is often the case with Markov processes on discrete state space, the details 
of the dynamics are not less important than the properties of its equilibrium 
measure. 

Under the moderate disorder hypotheses of [6], [10], [12], however, the upper 
bound on the mixing time implied by [TO] and suggested by [6], [12] is better 
than our upper bound in Theorem [1] (by a factor of order k). But we will see 
that introducing such moderate disorder hypotheses in our arguments directly 
improves our result by a factor of order m > k (see the last remark at the end 
of Section [2]). 

To close the discussion on simple exclusion processes we note that those of 
the arguments in [T^] that do not depend on the disorder suggest an upper 
bound on the mixing time of order m 2 lnm. Theorem [T] improves this estimate 
when k is small with respect to m. 
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As far as more general conditional non-homogeneous product of log-concave 
measures are concerned, we stress once again that Theorem [1] gives a uniform 
bound over the disorder that can be improved by a factor of order m by adding 
moderate disorder hypotheses (see the last remark at the end of Section fOl and 
our simulation Appendix). Then, such product measures can be equilibrium 
measures of zero-range processes with a continuous time generator as in [8], 
HQ]: 

where the Cj : kj G N H> [0, +oo) are such that Cj-(0) = and Cj(kj) > for 
fcj > 0. Indeed, such a process is reversible with respect to v provided that 

Vj G {1; • • • ; m}, Vfc, G N, = ]T In . (1.27) 

0<l<kj ^ ' 

Boudou, Caputo, Dai Pra and Posta [5] proved a Poincare inequality for such a 
process, assuming that there exists a positive c such that 

Vj G {1; ■ • • ; m}, Vfcj G N, c 3 (fc 3 + 1) - Cj (k 3 ) > c. (1.28) 

This is a moderate disorder hypothesis that implies the log-concavity of the \ij . 
To go to modified log-Sobolev inequalities, i.e., to good mixing time estimates 
rather than simple gap estimates, there is the additional hypothesis in [10] that 
there exists a non-negative 6 < c such that 

Vj G {1; • • • ; to}, Vfcj G N, Cj(kj + 1) - Cj(kj) <c + 5. (1.29) 

Clearly there are ultra log-concave measures that do not satisfy (|1.29l) : to have 
an ultra log-concave measure one needs a strongly decreasing V + </>j, i.e., a 
strongly increasing cj. Conversely, it is not true that (|1.28|) and (|1.29|) imply 
ultra log-concavity, i.e., 

Vj G {!;...;*>}, V*, > 1, In < - ■ (1.30) 

However, elementary algebra shows that (11.28)) together with (|1.29p implies 

VjG { l,..;rn},V fc ,>l,ln^±^<ln^| y (1.31) 

and this strangely looks like (|1.30|) . Therefore we said that the role played by 
ultra log-concavity is still unclear along the functional analysis line of research 
in the discrete setup. We conclude stressing once again that in this paper we 
will follow a different line, that our uniform bound on the mixing time comes 
from an elementary coupling argument and that we do not need any moderate 
disorder hypothesis. 
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1.3 Interpolating between sites and particles 

It seems that today available techniques are such that the less log-concavity 
we have, the more homogeneity we need to control the convergence to equi- 
librium. Staying to the papers mentioned above, without ultra log-concavity 
or at least something that looks like ultra log-concavity we only have Poincare 
inequalities for non-homogeneous product of log-concave measures, and with- 
out log-concavity we have modified log-Sobolev inequalities for homogeneous 
product measures only (see PH]). In addition, the only result we know for a 
conservative dynamics in (weakly) disordered context and with an equilibrium 
measure that is a product of measures that are not log-concave is that of Landim 
and Noronha Neto [7] for the (continuous) Ginzburg-Landau process. 

We will see that all the ideas of the proof of Theorem [T] can be extended to 
deal with a large class of conditional product of log-concave measures that are 
not ultra log-concave. To do so, let us define 

f 1 + k 

5 := max < A G [0; 1] : Vj G {1; ■ • • ; m}, V% > 0, -A^fc > A In 3 

(1.32) 

In other words, S is the largest real number in [0; 1] for which all the 

if := V+0 i + 51n(l + -), je{l;-;m}, (1.33) 

are non-increasing. Denoting by a A b the minimum of two real numbers a, b 
and defining 

h := k s (k A m) 1 - 5 (1.34) 

we will prove: 

Theorem 2 If 5 > then the Markov chain with transition matrix p defined 
by 

p(v,V ij ) = ^--exp\-Uf ] (k t -l)-^f ] (k 3 ) 
l s m I L ■> 

1] = (ki, . . . , k m ) & Xk,m \ ^ _1 {0}i 
' ' ./ {I:--- :»<!• (1.35) 
p(r),T]) = 1 - ^p(ry,?7 l - J ) (1.36) 

is reversible with respect to v and, for any positive e < 1, its mixing time t e 
satisfies 

U < {kA ™ ]1 ~ d km\n(k/e). (1.37) 
o 

Remark 1: By Holder's inequality, if 5 > then 

m m / m \ 5 / m \ 1—5 

E fc ' = E ^{^o} < E *0 ( £ - h (L38) 

i=l i=l ^ i=l ' ^ i=l ' 
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and this ensures that (|1.35[) - (|1.36p define a probability matrix. 

Remark 2: As far as this can make sense in our discrete setup, we note that 

the hypothesis S > is slightly weaker than a "uniform strict log-concavity 

hypothesis" (see (|T3"2]) ). 

Proof of Theorem 2: see Section [3J 

For S = 1 the transition matrix represents an algorithm starting with a 
uniform choice of a particle. For 6 = Theorem [2] is empty but (| 1 .35[) - (| 1 .36[) 
still define a Markov chain X that can be seen as a particular version of a 
discrete state space non-homogeneous Ginzburg-Landau process. In this case 
the transition matrix represents an algorithm that starts with a uniform choice 
of a (non-empty) site. The case < 6 < 1 can be seen as an interpolation 
between uniform choices of site and particle. More precisely, assuming that 
at time t £ N the system is in some configuration X t = rj = (k\, . . . , k m ) in 
Xk.m \ v _1 {0}, the configuration at time t + 1 can be decided as follows. 

• Choose a site i or no site at all with probabilities proportional to kf and 

• If some site i was chosen, then proceed as in the previously described 
algorithm using the functions ip^ instead of the tpj's, if n °tj then set 
X t +i = V- 

1.4 Last remarks and original motivation 

First, we note that as long as one wants bounds that are uniform over the 
disorder, Theorem 1 gives the right order for the mixing time: for the Fermi 
statistic in the very low temperature regime with, for example, v\ < v-i < ■ ■ ■ < 
v m and ni, ri2 > fc, the equilibrium measure will be concentrated on (fc, 0, . . . , 0) 
while, starting from (0, k, 0, . . . , 0), the system will reach the ground state in a 
time of order km In k for large k (this is a coupon-collector estimate) . This fact 
is illustrated in our simulation Appendix. 

Next, we observe that, with our definitions, p(rj, 77) can often be close to one, 
especially in strong disorder situations or when the right and left hand sides 
in (|1.38[) are far from each other. If one would like to use these results to perform 
practical simulations, then it could be useful to note that the computational 
time would still be improved by implementing an algorithm that at each step 
simulates, for a given configuration 77 on the trajectory of the Markov chain, 
the elapsed time before the particle reach a different configuration (this is a 
geometric time) and choose this configuration rf =/= 77 according to the (easy to 
compute) associated law. It would then be enough to stop the algorithm as soon 
as the total simulated time goes beyond the mixing time (and then return the 
last configuration, that the system occupied at the mixing time) rather than 
waiting for the original algorithm to make a step number equal to the mixing 
time. 

Turning back to the first naive and wrong idea, it is interesting to note that 
it can easily be modified to determine the most probable states of the system, 
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i.e., the configurations 77* = (k J, . . . , 



k^) in A?fe,m such that 



7 M 



m 



max 

feiH hfc m =. 



(1.39) 



One can prove, using the concavity of the <j>j's, that the most probable configu- 
rations for the system with k particles can be obtained from the most probable 
configurations (/c 1; . . . , k' m ) for the system with k — 1 particles simply adding 
one particle where the corresponding gain in 'free energy' is the highest, that is 
in j* such that 



As a consequence one can place the particles one by one, each time maximizing 
this free energy gain, to build the most probable configuration. 

Then, as a referee pointed out, since the sampling problem is a trivial one 
in the Poissonian case (when all the fij's are Poisson measures, so that v is 
nothing but a multinomial law M), one can ask about the expected time needed 
to perform a rejection sampling with respect to the multinomial case. It is 
given by max, v(r))/M(r)). If we take the Fermi statistics with m — k and all 
the rij equal to 1, then we find (optimizing on the multinomial parameters by 



a geometric/arithmetic mean comparison) k k /k\ ~ e fc /V27rfc. Of course, the 
sampling problem for that Fermi statistics is also a trivial one. But if we take 
j3 = and all the rij equal to 2, it is not anymore a trivial problem and we find 
an expected time for the rejection sampling that is logarithmically equivalent 



Turning back to the Fermi statistics we now explain why we were interested 
in bounds in k and m rather than in the volume n. Iovanella, Scoppola and 
Scoppola defined in [IT] an algorithm to individuate cliques (i.e., complete sub- 
graphs) with k vertices inside a large Erdos-Reyni random graph with n vertices. 
Their algorithm requires to perform repeated approximate samplings of Fermi 
statistics in volume n, with k particles and m = 2k + 1 energy levels. Now, the 
key observation is that the largest cliques in Erdos-Reyni graphs with n vertices 
are of order Inn, so that k and m in this problem are logarithmically small with 
respect to n. Before Theorem [1] the samplings for their algorithm were done by 
running simple exclusion processes with k particles on the complete graph (with 
site disorder) of size n. Such processes converge to equilibrium in a time of order 
fcnlnfc. Now the samplings are done in a time of order kmhik ~ 2fc 2 lnfc, and 
that was the original motivation of the present work. 

Finally we note that our bound in k and m is not only useful when k is small 
with respect to n but also when k is close to n. In this case we can define a 
dynamics on the n — k vacancies rather than on the k particles. 

2 Proof of Theorem 1 

In this section we assume that v is a measure on Xk,m deriving from a product 
of ultra log-concave measures, which means that we can write u(ki, . . . , k m ) = 




= max 

i 




(1.40) 



3 



to (e/2) k . 



10 



(l/Q)Uj eMki) and , for a11 3 G {!;••• 5 TO }> ^ defined by (jL2"2")) is non- 
increasing. 



2.1 Reversibility 

We first prove that the transition matrix p defined by (|1.23[) and (|1.24j) is re- 
versible with respect to the measure v. Let r\ = (ki, . . . , k m ) € Xk,m \ v {0} 
and let i ^ j £ {1; • • • ; m}. We have p(?7, i] 1 ^) ^ if and only if v{jf^ ^ and, 
in that case, 



v Ufi\ e <Ai(fc*-l)+<Ai(fci+l) 



while 



so that 



= exp{Vr.^ + V+^} (2.1 



p(i] tj ,n) kj + 1 



exp{-[-0j(fcj) - ipiih - 1)] + 



+ [^ i {k i -l)-^ j {k j )] + } (2.2) 

= exp{^(fc l -l)-V J (fc J )+ln(fc J +l)-ln(fc 4 )} (2.3) 

= exp{V+_ 1 4 -V+^} (2.4) 

= exp{-V^-V+<^} (2.5) 

i/(r))p(r), rf ] ) — v{rfi )p(yfi ,r/). (2.6) 



2.2 A few words about the coupling method 

In order to upper bound the mixing time of the Markov chain with transition 
matrix p, we will use the coupling method. Given a Markov chain (X 1 ^ 2 ) on 
X-k.m x ^k,m, we say it is a (Markovian) coupling for the dynamics if both X 1 
and X 2 are Markov chains with transition matrix p. Given such a coupling, we 
define the coupling time r coup i c as the first (random) time for which the chains 
meet, that is 

rcoupic := inf{t > : X] = X 2 }. (2.7) 
In this work, every coupling will also satisfy the condition 

t > T coup ic => X\ = X 2 . (2.8) 

Then, it is a well-known fact that for all t > 0, 

d(t) < max P(r couplc > t\X% = n,X 2 = 9) (2.9) 

where d(t) is defined by (jl.lOj) . A proof of this fact as well as an exhaustive 
introduction to mixing time theory can be found in |14j . 

In the proof of both [1] and [2] we will build a coupling for which there exists 
a function p that measures in some sense a 'distance' between X\ and X 2 and 
from which we will get a bound on the mixing time thanks to the following 
proposition. 
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Proposition 2.2.1 Let (X 1 ^ 2 ) be a coupling for a Markov chain with tran- 
sition matrix p. We assume that the coupling satisfies the property H2.8\) . Let 
p : Xk.m x Xk,m N such that p{rj, 9) — if and only if n = 9. If M is the 
maximum of p and if there exists a > 1 such that, for all t > 0, 

E (p{X l t+1 ,X? +1 )\XlX?) < (l- I) p(X},X?), (2.10) 

then for all e > the mixing time t e of the dynamics is upper bounded by 
aln(M/e). 

Proof: Remark that (|2.10p actually means that (p(X^ , X 2 )(l — l/a)~*) te N is 
a super-martingale. Taking the expectation we get 

E(p(X} +1 ,X? +1 )) < (l " ~) E{p{XlXl)) (2.11) 

As a consequence 

E(p{X},X*))< (l-^j E(p(X^X^))<Me-i. (2.12) 

Since we assume p(r\. 9) — if and only if r] = 9 and (|2.8|) . 

P(r couplc > t) = P{p{XlX 2 t ) > 0) = P(p(X t 1 , X t 2 ) > 1). (2.13) 
From Markov's inequality and ([2.121) we deduce 

P(r CO upio >t)< E(p(X},Xf)) < Me"-. (2.14) 

Since the upper bound in (|2.14[) is uniform in Xq and Xq, according to (|2.9[) we 

get 

d(t)<Me~°. (2.15) 
Thus, given e > 0, if t > aln(M/e) then d(t) < e, so that t e < aln(M/e). □ 

2.3 A colored coupling 

In order to prove Theorem [TJ we introduce a dynamics on the set of the possible 
distributions of the k particles in the m energy levels. Therefore we define the 
set of the configurations of k distinguishable particles in m energy levels 

n:={w:{l;...;fc}->{l;...;m}} (2.16) 

and for every such distribution uj, we define = (fci, . . . , fc m ) € Afc, m by 

k 

V* G {1; . . . ; m}, fc, := ^ *{«(*)=*}• (2.17) 

x=l 
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We will couple two dynamics w 1 and ui 2 on ft, then we will work with the cou- 
pling (^(w 1 ), £(w 2 )). We will build the coupling (oj 1 ,^ 2 ) thanks to the coloring 
we now introduce. 

At step t, a red-blue coloring of {oj\,ujf) is a couple of functions C 1 , C 2 : 
{1; . . . ; fc} — > {blue; red} such that for all energy level i G {1; . . . ; m}, the number 
of blue particles in the level i is the same in both distributions, i.e., if \A\ refers 
to the cardinality of the set A, 

\{x : C^x) = blue,u)}(x) = i}\ = \{x : C 2 (x) = b\uc,uj 2 (x) = i}\ (2.18) 

and an energy level cannot contain red particles in both distributions 

C 1 (x)=red Vy , ( W 2 (y) = uj\ (x) C 2 (y) = blue), (2.19) 

C 2 {x) = red Vy, (w t x (y) = w 2 (:r) C 1 ^) = blue). (2.20) 

As a consequence of (|2.18|) , there exists a one-to-one correspondence 

$ : {x : C^x) = blue} -> {x : C* 2 (a;) = blue} (2.21) 

such that for all cc, w 2 ($(a;)) = uj}(x). Since the number of blue particles 
is the same in both distributions, so is the number of red particles. Moreover, 
£(wj) = £(w 2 ) if and only if all particles are blue. Then we are willing to provide 
a coupling (cj 1 , u 2 ) for which the number of red particles is non-increasing. This 
number does not depend on the red-blue coloring and at any time t we will call 
it p t . Writing ^wj) = (fcj,...,^) and £(cj 2 ) = {k 2 ,...,k 2 m ) we have the 
identities 

rn m m 

* = \1L \ k i k '\ = E^ 1 = E^ 2 - ( 2 - 22 ) 

i—1 i—1 i—1 

We are now ready to build our coupling (u; 1 ,^ 2 ). Given (uj},u> 2 ) and a red- 
blue coloring (C 1 , C 2 ) at step t (such a coloring certainly exists for any couple 
of distributions w 2 )), let x 1 be a uniform random integer in {1; . . . ; fc}. 

• If C 1 (a; 1 ) = blue: then we set x 2 = ^(x 1 ) where $ is provided by (12.211) . 

• If C 1 (a; 1 ) = red: let x 2 be a uniform random integer in {x : C 2 {x) = red}. 

Lemma 2.3.1 The random integer x 2 has a uniform distribution on the set 
{!;... ;k}. 



13 



Proof: We write 



fc 



P(x 2 = y) 



yjx 1 = x)P(x l = x) 



(2.23) 



E * 




x:C 1 (a;)=blue 



+ E 



x-.C 1 (a:)— red 



fl-{C 2 (a)=rod} 1 

Pt k 



(2.24) 



H-{C 2 (i/)=bluc} 



+ 



l{C 2 (y)=rcd} Pt 
Pt k 



(2.25) 



1 

k' 



(2.26) 



□ 



We then choose an energy level j uniformly in {!;...; m} and we write £(W() = 



(hi,..., hi) and^ 2 )^ 2 ,...,^). 

• If C 1 (x 1 ) = blue: then C 2 (x 2 ) = blue, and x 1 , x 2 are in the same energy 
level i := ujjix 1 ) = ujf(x 2 ). Let a ^ b e {1;2} such that ipi(kf - 1) - 
< - 1) - V^(fc-)- Then, 



Let [/ be a uniform random variable on [0; 1). 

-lfU< Pb- then we set ujf +1 (x) = ujf(x) for all x ^ x a , w t a +1 (a; a ) = j, 
u b t+l {x) = uj b t {x) for all x ^ x b and w t fc +1 (a; b ) = j. Then £(w t a +1 ) = 
and £(u>t+i) = (^( a; t))*' 7 ) an d f° r an y red-blue coloring of 
(wL_x> ^t+i) the number of red particles pt+i remains the same (since 
both particles x 1 and x 2 have moved together). 

— If Pb < < Pa- then we set w° +1 (a;) = for all x ^ x a , 
ujf +1 (x a ) = j and w t b +1 (a;) = w t fc (a;) for all at. Then £(w t Q +1 ) = 

))*■?' and £( w t+i) = C( w t)- The only situation in which the 
number of red particles could increase is the following: k°- > k b 
and fe° < /c b . Since ipi and ^ are non-increasing, this would imply 
Pa < Pb that contradicts Pb < U < p a - Then, p t +\ < pt for any 
red-blue coloring of (uj} +1 ,oj 2 +1 ). 

— If U > p a : then we set = and = lo\. Then ^(w t a , t ) = 
£«), ^ h t + i) = ZH) and Pt+1 = p t . 

• If C 1 (x 1 ) = red: then C 2 (x 2 ) — red, and we define i 1 :— ljKx 1 ) and 
i 2 := lj 2 (x 2 ). Note that according to ([2T9]) . i 1 ^ i 2 . Let then y 1 ,^ 2 be 
two independent uniform random variables on [0; 1). 



Pa 



:= exp{-MK-l)-^m)] + } 

> eX p{-[^(fe b -l)-^(fc 3 6 )]+} =: p 6 . 



(2.27) 
(2.28) 
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- If V 1 < exp{-[r/>ii(fc?i - 1) - ipj(kj)] + } then we set u\ +1 (x) = u\{x) 
for all x ^ x 1 and w^^x 1 ) — j and then = (£(0^))^; 
otherwise we leave = coj. 

- If V 2 < exp{-[V' i 2(fc i 2 2 - 1) - ipj(tf)} + } then we set uj 2 +1 (x) = lo 2 (x) 
for all x ^ x 2 and w 2 +1 (:r 2 ) = j and then £(w 2 +1 ) = (^(w 2 ))*- 7 ; 
otherwise we leave to 2 +1 — uj 2 . 

Whether red particles move or not, the number of red particles cannot 
increase, so it is clear that p t +i < pt- 

We conclude 

Proposition 2.3.2 (pt)teN is a non-increasing process. 
and claim 

Proposition 2.3.3 The process (^(w 1 ), £(w 2 )) is a coupling for the Markov 
chain with transition matrix p. 

Proof: Writing £(uj}) — r\ and given i, j <E {1; . . . ; m}, 

p(^K 1 +1 ) = ^) 

= P(€(wl +1 ) = rf^&ix 1 ) = bluc)P(C 1 (x 1 ) = blue) 

+ P(eKVi) = rf^&ix 1 ) = rcd)P(C 1 (a; 1 ) = red). (2.29) 



We have 
and 



PiC^x 1 ) = red) = 1 - Pi&ix 1 ) = blue) = £ (2.30) 



P(^l +1 ) = r ] i3 \C 1 (x 1 ) = blue) 

= P(w t 1 (a; 1 ) = ilC 1 ^ 1 ) = blue) x -j- x P(t/ < Pl ) (2.31) 

= (fc - pt ) m ^-^fe 1 - 1) - (2.32) 

P(^ t 1 + i) = r? ij |C 1 ( a; 1 )=red) 

= PK'fx 1 ) = zlC 1 ^ 1 ) = red) x — x P(F 1 < Pl ) (2.33) 

m 

J " /,Tl+ exp{-[^-l)-^-(fei)] + } (2-34) 



^7 \ n i , 



which finally leads to 



P(£KVi) = = ^ exp{-[^(fc? - 1) - = p{v, V ij )- (2-35) 



15 



Then, P(^(uij +1 ) = rfi \uij , wf) depends on^{uj}) only, which means that £(w x ) is 
a Markov chain. Besides, according to f|2 . 35[) its transition matrix is p. Finally, 
by Lemma T2. 3. 11 the same is true for £(oJ2 )• O 

From now on, we will write X\ := ^(uil) and X 2 :— £(w 2 ). The previous 
proposition ensures that (X 1 , X 2 ) is a coupling for the dynamics with transition 
matrix p. 

2.4 Estimating the coupling time 

We will use Proposition 12.2.1] Since (p t ) tg N is non increasing and pt = if and 
only if X\ — X 2 , all we have to do is to estimate from below the probability of 
the event {p t +i < Pt}- 

Proposition 2.4.1 // at step t of the coupled dynamics (<jj l ,u) 2 ) we assume 
that red particles have been chosen, i.e., C 1 ^ 1 ) = red = C 2 (x 2 ), then there 
is a choice of j £ {i l ;i 2 } for which the number of red particles decreases with 
probability 1, where i 1 (resp. i 2 ) still refers to u>l(x ) (resp. lu 2 (x 2 )). 

Proof: If the inequalities 

eJtp{-[Vii(fc£-l)-Mfci»)] + } < 1 ( 2 - 36 ) 
exp{-[^ 3 (4-l)-^(4)] + } < 1 ( 2 -37) 

hold together, then i/v {k\i — 1) > V^ 2 an d ^i 2 (k% — 1) > (k 2 i)- Besides, 
since C 1 (x 1 ) = red, according to (|2.19|) k\x > k 2 x from which we get k\i — 1 > 
fc 2 ! and, since is non-increasing, tp^fjth — 1) < ^i(fc 2 i). Likewise, since 
C 2 (x 2 ) = red we have 1/^2 (& 2 2 — 1) < 1/^2 (/c^). We finally may write 

VV (4 - 1) > ^2 (4 ) > ^ (fc 2 - 1) > ^1 (4 ) > ^ (4 - 1) (2.38) 

which is absurd. As a result, either (|2.36[) or (|2.37[) is false. For instance 
let us assume that (|2.36[) is false, then if j = i 2 , with probability 1 we have 
uj\ +1 {x) = ojj(x) for all x ^ x 1 , w t 1 +1 (a; 1 ) = i 2 7^ ^(x 1 ) and w 2 +1 = w 2 . Then, 
the number of red particles for any red-blue coloring of (ujj +1 ,oj 2 +1 ) is exactly 
Pt+i = Pt - 1- □ 

It follows 

Corollary 2.4.2 At step t, if pt > 0, the probability for pt+i to be p t — 1 is at 
least pt/km. 

Consequently, and owing to the fact that pt cannot increase, we have the 
inequality E(p t+ i - pt\X$,Xf) < -P(p t+ i = p t - l\Xl,X$) from which wc 
deduce 

(2.39) 



E{p t+1 \XlX 2 )< (l-±-) Pt . 

\ km J 



Then we can apply Proposition 12.2. II to p(X},X 2 ) = p t with M — k and 
a = km, which finally proves Theorem [TJ 
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Remark: Adding "moderate disorder hypotheses" we can gain a lot, depending 
on the specific model we consider. For example in the case of our simple exclu- 
sion dynamics, i.e., for the Markov chain associated with the Fermi statistics 
when all the n/s are equal to 1, we can gain a factor of order m. We can indeed 
assume that k/m < 1/2 (if not, we consider a dynamics on the vacancies rather 
than on the particles as mentioned in Section fO} and, if, for example, v derives 
from an homogeneous product measure, i.e., ipi = ipj for all i and j, then there 
are much more than one choice for j for which the number of red particles will 
decrease with probability 1 when red particles were chosen: any of the more 
than m/2 vacant sites in X\ or X 2 will do the job. In this case we gain a factor 
m/2, first in Corollary 12.4.21 then in the final estimate. More generally, if the 
functions \ipi — ipj\ are uniformly bounded (this is a moderate disorder hypoth- 
esis) , then, when red particles are chosen together with one of these more than 
m/2 vacant sites, the probability that the number of red particles decrease is 
bounded away from zero: once again we gain a factor of order m. 



3 Proof of Theorem 2 

We now work with the dynamics defined by (ll.35[) - (|1.36[) and we assume 5 > 0. 
First, the reversibility of this dynamics with respect to v still holds, with exactly 
the same computation as in Subsection 12. II However, it is no longer possible to 
work with an underlying process u t £ Q since the factor kf cannot stand for a 
number of particles as soon as S < 1. Therefore we need to adapt the coupling 
(X 1 , X 2 ) directly on X k>m . 



3.1 Generalizing the previous coupling 



At step t, let us assume X\ — {k\, . . . , 


kl n ) e X k ,m and X 2 


%k,m- We define the following sets: 




Ri := {ie{l;. 


.;m} : k\ > k 2 }, 


i? 2 := {*g{1;. 


■ ;m}:k 2 > kj}, 


B := {*G{1;. 


. ;m} : k} = fe?}, 


and the following quantities: 




Wl := Y, 


-i:=E( fc ') 5 ' 


ieRi 


iER-2 




«4 :-E(^) 5 ' 


ieR.2 


ieRi 


w B :=5>l) 


5 = Y( k f) 5 - 


ieB 


ieB 


Finally wc define 




m m m 

Pt :=iEi fc i- fe ii = E^- fe "] + = E[ fc '- 

2—1 i—1 i—1 



[k\, . . . , k m ) € 

(3-1) 
(3.2) 
(3.3) 

(3.4) 
(3.5) 
(3.6) 

|+. (3.7) 
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Keeping in mind the previous coloring, R\ (resp. R 2 ) is the set of sites in 
which there are red particles for the first (resp. the second) configuration, B is 
the set of sites in which there are only blue particles or no particles for both 
configurations, w\ (resp. w 2 ) is proportional to the probability to choose a site 
for the first (resp. the second) configuration in which there are red particles, w'± 
(resp. w 2 ) is proportional to the probability to choose a site for the first (resp. 
the second) configuration in which there are only blue particles while there are 
red particles in the second (resp. the first) configuration, u>b is proportional 
to the probability to choose a site in which there are only blue particles for 
both configurations, and Is — (wb + Wi + w[) (resp. Is — (wb + w-i + w 2 )) is 
proportional to the probability not to choose any site for the first (resp. the 
second) configuration, and it can be positive as soon as S < 1. Finally, p t still 
stands for the number of red particles and it is clear that p t = if and only if 

N.B. In the remaining part of this subsection, we assume w\ + w[ > w 2 + w 2 
in order to not overload the notations and not increase the number of cases to 
investigate. Obviously, the case Wi + w[ < w 2 + w 2 is exactly symmetric. 

Let / be a uniform random variable on [0; Is). 

(i) If / < Wb- then there exists a unique i e B such that 

E E (3-8) 

and we set i 1 = i 2 = i. 

Remark: We then have, for alH e {1; . . . ; m}, 

pa^m = % es} ^, (3.9) 

WB 

p(i 2 = m) = i {jeB} ^ = i {teB} ^. (3.10) 

(ii) If wb < I < + w[: then there exists a unique i 6 R2 such that 

E {kl) 5 <I-w B < E ^') S ( 3 - n ) 

and we set i 1 = i 2 = i. 

Remark: We then have, for all i e {1; . . . ; m}, 

P(i i =i\(n)) = l { ,eK 2} ^, (3.12) 
P(z 2 = z|(ii)) = l {J eK 2 }^. (3.13) 
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(iii) If wb + w[ < I < Wb + w'\ + w\\ then there exists a unique i E i?i such 
that 

E (kif <i-w B -w[< J2 ^') S - ( 3 - 14 ) 

We set i 1 — i and we define u := I — wb — w[ — X)i'efli i'<i(^i / ) <5 i so that 
< u < (kj) s . Notice that since i E Ri, kj > kf. If u < (k 2 ) s then we 
set i 2 = i. Otherwise, for all i 1 E R 2 we write ki> :— J2ieR 2 -£<i'(^ S an< ^ 
we denote by T the disjoint union of intervals 

T -=\ U ^ + {k} l ) s -k ll + {kl) s ) \ U[w 2 +w' 2 ;wi+w[). (3.15) 

Let /' be a uniform random variable on T. If there exists i' E R 2 such 
that V E \ki> + {k},) 5 \ki> + {kf,) 5 ) then we set i 2 = i'. Else we do not 
define i 2 . 

Remark: We then have, for all i E {1; . . . ; m}, 

P(i 1 =i|(iii)) = l {iefll} iM]! (3.16) 

and for all i' E {1; . . . ; m}, 
P(i 2 =i'|(iii)) 
1 

y (kl) S -(kf) s {k 2 ,) s -(klY 

+ ^'^ 2 }E x MX) (3J7) 



where 



A(T) := ]T ((fc 2 ,) 5 - (4)*) + (wi + K) - (w 2 + w> 2 ) (3.18) 
= wi-w 2 (3.19) 



so that 



P(z 2 = i'|(iii)) = ^enj-^ + %^} (fc '' ) '„ {k] ' )S - ( 3 - 2 °) 



Wl 1 ' Wl 



(iv) If / > + wi + w[ (this case cannot occur when 5 = 1): then we do not 
define i 1 and i 2 . 

Before going ahead with the definition of our coupling we note, as a direct 
consequence of our remarks in (i), (ii), (iii) and of of the fact that / has a 
uniform distribution: 



19 



Proposition 3.1.1 For all i e {l;...;m}, Ptf 1 = i\X},X?) = {kj) 5 /l s and 
P(i*=i\XlX?) = {k1) s /l 5 . 

We then choose an integer j <G {1; . . . ; m} with uniform law and we distinguish 
once again between our four previous cases. 

(i) If i 1 e B: then i 2 = i 1 , we just write i 1 = i 2 = i. Then h\ — kf. Thus, 



let a, b e {1; 2} such that a^b and kj < k b y Since both ipf^ and ip l P ar e 



non-increasing, 

p a := exp 
> exp 



V^-i)-^ 1 ^) 

^{k\-l)-i,f{k)) 



=■■ Pb- 



(3.21) 
(3.22) 



Let U be a uniform random variable on [0; 1). 

- If U < Pb : we set - and X b t+l = (X^ . 

- lip h <U < p a : we set Xf +1 = {X^ and X b t+1 = X b t . 

- If pa < U: we set X? +1 = X? and X b t+1 = X h t . 

In any of these cases, we certainly have p t +i = pt- 



(ii) If i 1 £ i?2: then i 2 — i 1 , we just write i 1 



that a ^ b and ^f J (fcf - 1) - Vf(^) < ^f\k\ - 1) - V/ J (*#), so that 



i. Let a, b e {1; 2} such 
-.[«]/ 



Pa 



:= exp 
> exp 



Pb- 



(3.23) 
(3.24) 



Let U be a uniform random variable on [0; 1). 

-UU< p h : wc set Xf +l = {X?f3 and X b +l = (X b )^ . 

- If p h < U < p a - we set Xf +l = (X?)v and X b +1 = X h t . 

- H Pa < U: wc set X? +1 = X? and X b +1 = X\. 

In the last case we obviously have pt+\ — Pt- In the first case the particles 
move together and p t +i = Pt- In the second case the number of red 
particles could increase only if kf < k\ and fc" > k b , but, since tpf^ and 

ip^ are non increasing, this would contradict p a > Pb- As a consequence 
we have p t +i < pt in all the three cases. 
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(iii) If i 1 £ i?i: there are three cases for i 2 . Either i 2 = i 1 = i and this case is 
the symmetric of (ii). Or i 2 is randomly chosen in R 2 , and we define 



Or else i 2 is not defined, and we set p 2 ■= still defining p\ by (|3.25|) . Let 
then V 1 , V 2 be independent uniform random variables on [0; 1). 

- If V 1 < pi: we set Xj +1 = (Xjfi, else we set Xj +1 = X\. 

- If V 2 < p 2 : we set X 2 +1 = (X 2 )^ 3 , else we set X 2 +1 = X 2 . 

In the first case we have pt+i < Pt as previously. In the last two cases we 
also have pt+i < Pt since only particles from R\ in the first configuration 
and from R 2 in the second configuration can move. 

(iv) If i\ and i 2 are not defined: then we simply set (X£ +1 , Xf +1 ) = (X^,X 2 ) 
and we have pt+i — Pt- 

In any of the previous cases, once i , i 2 and j have been defined, the prob- 



ability for X} +1 (resp. X 2 +1 ) to be {X^i (resp. (X 2 ) 1 ^) is exp{-[^ ] (kj, - 
1) - (fc) )] + > ( resp. exp{-[^f ] (4 - 1) - *pf ] (k 2 )} + }). Thus, according to 



Proposition I3.1.T1 the fact that j is uniformly chosen in {l;...;m} and our 
study on the variation of p we conclude: 

Proposition 3.1.2 The process (X 1 , X 2 ) is a coupling for the dynamics defined 
by il.35]) - [1.36]) and such that (p t ) t£ N is non-increasing. 

3.2 Estimating the coupling time 

Similarly to the proof of Theorem Q] we will use Proposition 12 . 2 . ll since (pt)teN 
is non-increasing it will be enough to give a lower bound for the probability of 
{pt+i < pt}. 

Proposition 3.2.1 If at step t of the coupled dynamics (X 1 , JT 2 ), we assume 
that "red particles have been chosen", i.e., i 1 £ R\ and i 2 £ R 2 , then, there 
is a choice of j £ {i l ]i 2 } for which the number of red particles decreases with 
probability 1. 

Proof: Assuming i 1 £ R\ and i 2 £ R 2 yields fcA > k 2 x and kh < k 2 2 . Us- 
ing exactly the same argument as for Proposition 12.4.11 we prove that either 



cxp{-[V# J (A£ -1)-V# J (*&)]+} = 1 or exp{-[^ J (4 - 1) - ^ (*£)]+} = 1. 



Eventually, if one red particle in some configuration moves to a site with a red 
particle in the other configuration, then both particles turn blue and the number 




(3.26) 



(3.25) 



of red particles decreases by one. 



□ 
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Corollary 3.2.2 At step t, the probability for pt+i to be pt — 1 is at least 
5k & - l Pt /ml 5 . 

Proof: The probability to choose i 1 £ R\ and i 2 £ R2 is 
Pii 1 £ Ri,i 2 £ R 2 ) 

= ^2 ^Pii 2 = 1'^ = i)x = 1) 



i'£R,2 i£Ri 

E E 

j'ei?2 iefii 



1\5 



(kir a(t) 



Is 



(3.27) 
(3.28) 
(3.29) 



Since, for any concave function / : K + — > M and any s £ N \ {0}, (z\, . . . , z s ) £ 
M.+ 1 — ^ f(J2i z i) ~ Tli f( z i) i s non-increasing in all its s variables (as a conse- 
quence of the slope inequalities), by concavity of z M> z 5 and using the fact that, 
for all i' £ R2 , kf, > kj, we get 



Pit 1 £ R u i 2 £ R 2 ) > - 

Is 



1 

h 



E *?) - ( E 



p* + E 4 

i'eii 2 



i'e-R.2 



(3.30) 
(3.31) 



Using the same property of concave functions on ]R + (with s = 2) and the fact 
that X)i'e_R — k ~ Pt> then using once again the concavity of z i-> z s , we 
write 



P(i J £ R U t 2 £ R 2 ) > £ [k s -(k- Pt ) s ] > 



so that, by the previous proposition, 



P(p t+1 =p t -l\Xl,X?)> 



dk^pt 
mis 



As a consequence 



E(p t+1 \X t \Xf)< 1 



km(k A m) 1 5 

and, by Proposition 12 . 2 . il this finally proves Theorem [21 



Pt 



(3.32) 

(3.33) 
□ 

(3.34) 



A Simulation appendix 

In this appendix we report on simulations for the temporal evolution of d(t) 
defined in (|1.10p as well as the dependency in (3 of the mixing time t e defined 
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in (|1.9p with e = 0.1 in the context of the Fermi statistics v associated with 
k = 50 particles among m = 20 energy levels with two different profiles. 

• In the first case we chose Vj = j/20 and rij — 2 J for 1 < j < m so that 
n = J2j V = 2 21 - 2. See Figure [TJ 



Figure 1: Degeneracies of the energy levels for case 1, in semi- logarithmic rep- 
resentation for the second picture. 

• In the second case we chose the same Vj's, the same volume n = 2 21 — 
2 and we chose the degeneracies according two a multinomial law with 
parameters n, 1/20, . . . , 1/20. See Figure[2j 







ill 









Figure 2: Degeneracies of the energy levels for case 2, with shifted and rescaled 
axis for the second picture. 



We were then interested in 



d(t)= max \\p t (r}o,-)-v\\TV= max ~ V |p*(jjo, 77) - v{rj)\ (A.I) 

for t > 0. The cardinality of X^ m being very large (equal to Ug) ~ 5 x 10 16 in 
case 2), one faces three difficulties with such a formula: 
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i) One cannot compute 1/(77) an d P w)j 7 7) for given 77 and 770. 

ii) The sum in (|A. 1|) contained too many terms to be computed. 

iii) It is not possible to try all the possible 770 before taking the maximum 
in (TO) . 

We addressed the first difficulty by replacing (770, •) by the empirical measure 

^■■=nY, S xI ( A - 2 ) 

1=1 

computed with a large number N of simulated independent copies (X|) te pj of 
our Markov chain starting from 770 and, using Theorem [U by replacing v by 

2T 

N 1 > N 1 " N ( A o\ 

v := 2T ^ Mt Mt ^ ' 

t=T+l 

with T := Lfc77iln(£;/e)J and where fi' t N and /i"^ are independently computed 
like ji^ , starting from two different configurations. We addressed the second 
difficulty by looking at a coarse grained version Xk, m of Xk, m through the free 
entropy (j). More precisely, with I N = {<p- , 4>+] the smallest interval containing 
the support of <t>{v N ) = v N o we divided / into M intervals of length 
A = (<fi + — <j)^)/M, we extended this partition of I into a partition of R in 
intervals of length A and grouped in a same class the configurations 77 6 X^ m 
that fall into a same interval. (For possible cases where 0_ =</>+, we used a 
partition of R into intervals of length 10~ 6 .) With <f> the canonical projection 
from Xk.m to Xk,m we could then compute \\4>(^) — 4>{v n )\\tv rather than 
- v N \\ TV . Note that 

U(^) - 4>{v n )\\tv < ||Mf - i^Hrv, (A.4) 

and we also have, for large M and as soon as tfi is an injection from Xk, m to R, 

||/xf - v N \\ TV = H^/if ) - 4>{v n )\\tv + (A.5) 

As far as the third difficulty was concerned we underestimated 

d N (t)= max U(^)-$(v N )\\ TV (A.6) 

by making a guess on a particular configuration 770 for which — <1)(v n )\\tv 

could be of the order of d N (t). We simply took an 770 for which we could have 
say, for all (3, that it was "very far from typical configurations" . We first took 
770 = (0, . . . , 0, fc) since typical configurations are concentrated on the low energy 
levels when f3 is large and have their occupation numbers kj distributed like the 
degeneracies rij when /3 is small. Since this last point tends to show that in 
case 1 our specific 770 is "not so far" from typical equilibrium configurations (we 
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will come back on this point later), we also considered a different 770, that one for 
which the high energy levels are empty and the low energy levels are saturated 
or as close to saturation as possible: 



'/() 



(2,4,8, 16, 20,0,..., 0) in easel, 
(50,0,...,0) in case 2. 



(A.7) 



Then we simply took the maximum of the two quantities \\(f>(fj,^) 
computed, in each case, with these two specific choices. 
Summing up, we approximated d(t) by 



(^)ll 



TV 



max ||0( M f)-^K)|| xv 



(A., 



where X® m is the two configuration set described above. We chose N — 1024 

and M — \/N = 32 and we plotted for different temperatures a graphical 
representation of the law of (f)(v N ) and the temporal evolution of d$ it) for 
< t < 2T. See Figures [3] to for case 1 and Figures 181 to IT2"1 for case 2. 



Figure 3: Probability distribution function of 4>iy ) and temporal evolution of 
dy it) for case 1 and (3 = 0. 



Figure 4: Probability distribution function of 1 
dg it) for case 1 and (3 = 4. 



) and temporal evolution of 
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Figure 5: Probability distribution function of (j)(v N ) and temporal evolution of 
cIq (i) for case 1 and (3 = 16. 



Figure 6: Probability distribution function of (j)(v N ) and temporal evolution of 
(1q (t) for case 1 and (3 — 63. 
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Figure 7: Probability distribution function of (j)(v N ) and temporal evolution of 
dg{t) for case 1 and (3 = 1000. 
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Figure 8: Probability distribution function of <j){v N ) and temporal evolution of 
(1q (t) for case 2 and (3 = 0. 
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Figure 9: Probability distribution function of 4>(v N ) and temporal evolution of 
d${t) for case 2 and (3 = 4. 



Figure 10: Probability distribution function of <f>{v N ) and temporal evolution 
of d~Q (t) for case 2 and (3 = 16. 
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Figure 11: Probability distribution function of <t>{v N ) and temporal evolution 
of d,Q (t) for case 2 and /3 = 63. 




Figure 12: Probability distribution function of <t>{v N ) and temporal evolution 
of d$(t) for case 2 and (3 = 1000. 

For each temperature we get in each case an estimation t e of t e . Estimating 
t e in this way for 51 temperatures (y3 = included) we show in Figure [T51 the 
graphical representations of the simulated dependency of t e on j3 > together 
with our upper bound from Theorem [TJ 

We can now conclude with a few comments. First, with such a procedure, 
t e is essentially underestimated. Indeed, except for the replacement of £>*(??o> ■) 
and v by ^ and v our successive approximations underestimated d(t) then 
t t . As a consequence we recover with case 2 that km\n(k/e) is the best upper 
bound on t e that is uniform over the disorder. 

Second, we note that (3 can be seen as a tuning parameter for the disorder. 
We recover that our theoretical estimate on t e can be improved by a factor of 
order m in weak disorder situations. 

Last, it is worth to note that case 1 suggests that t e is not generally decreas- 
ing with the temperature. In our preliminary simulations we only considered 
rjo = (0, . . . , 0, 50) to estimate the mixing time. Then we feared that our non 
monotonic estimation could be an artefact of our simulation. This led us to 
introduce the alternative saturated low level starting configuration, confirming 
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Figure 13: Dependency of t £ on j3 in case 1 and 2 respectively (semi-logarithmic 
representation). The horizontal line stands for our theoretical upper bound. 

in that way the non-monotonicity. 
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